National Repository of Grey Literature 4 records found  Search took 0.01 seconds. 
Information Extraction from Wikipedia
Valušek, Ondřej ; Otrusina, Lubomír (referee) ; Smrž, Pavel (advisor)
This thesis deals with automatic type extraction in English Wikipedia articles and their attributes. Several approaches with the use of machine learning will be presented. Furthermore, important features like date of birth in articles regarding people, or area in those about lakes, and many more, will be extracted. With the use of the system presented in this thesis, one can generate a well structured knowledge base, using a file with Wikipedia articles (called dump file) and a small training set containing a few well-classed articles. Such knowledge base can then be used for semantic enrichment of text. During this process a file with so called definition words is generated. Definition words are features extracted by natural text analysis, which could be used also in other ways than in this thesis. There is also a component that can determine, which articles were added, deleted or modified in between the creation of two different knowledge bases.
Classification of Music Files Using Machine Learning
Sládek, Matyáš ; Smrčka, Aleš (referee) ; Janoušek, Vladimír (advisor)
This thesis is focused on classification of music files using machine learning algorithms. Seven classifiers were compared in this thesis, based on classification accuracy and speed. Two feature extraction methods, two feature selection methods and two parameter optimization methods were used. The best classifier proved to be XGBClassifier, which had reached accuracy of 87.56 % on dataset Extended Ballroom Dataset, 64.56 % on dataset FMA: A Dataset For Music Analysis and 83.50 % on dataset GTZAN. This model could be used for playlist creation or music database categorization.
Classification of Music Files Using Machine Learning
Sládek, Matyáš ; Smrčka, Aleš (referee) ; Janoušek, Vladimír (advisor)
This thesis is focused on classification of music files using machine learning algorithms. Seven classifiers were compared in this thesis, based on classification accuracy and speed. Two feature extraction methods, two feature selection methods and two parameter optimization methods were used. The best classifier proved to be XGBClassifier, which had reached accuracy of 87.56 % on dataset Extended Ballroom Dataset, 64.56 % on dataset FMA: A Dataset For Music Analysis and 83.50 % on dataset GTZAN. This model could be used for playlist creation or music database categorization.
Information Extraction from Wikipedia
Valušek, Ondřej ; Otrusina, Lubomír (referee) ; Smrž, Pavel (advisor)
This thesis deals with automatic type extraction in English Wikipedia articles and their attributes. Several approaches with the use of machine learning will be presented. Furthermore, important features like date of birth in articles regarding people, or area in those about lakes, and many more, will be extracted. With the use of the system presented in this thesis, one can generate a well structured knowledge base, using a file with Wikipedia articles (called dump file) and a small training set containing a few well-classed articles. Such knowledge base can then be used for semantic enrichment of text. During this process a file with so called definition words is generated. Definition words are features extracted by natural text analysis, which could be used also in other ways than in this thesis. There is also a component that can determine, which articles were added, deleted or modified in between the creation of two different knowledge bases.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.